首页> 外文OA文献 >Temporally Coherent Bayesian Models for Entity Discovery in Videos by Tracklet Clustering
【2h】

Temporally Coherent Bayesian Models for Entity Discovery in Videos by Tracklet Clustering

机译:用于实体发现的时间相干贝叶斯模型   跟踪聚类

摘要

A video can be represented as a sequence of tracklets, each spanning 10-20frames, and associated with one entity (eg. a person). The task of \emph{EntityDiscovery} in videos can be naturally posed as tracklet clustering. We approachthis task by leveraging \emph{Temporal Coherence}(TC): the fundamental propertyof videos that each tracklet is likely to be associated with the same entity asits temporal neighbors. Our major contributions are the first Bayesiannonparametric models for TC at tracklet-level. We extend Chinese RestaurantProcess (CRP) to propose TC-CRP, and further to Temporally Coherent ChineseRestaurant Franchise (TC-CRF) to jointly model short temporal segments. On thetask of discovering persons in TV serial videos without meta-data like scripts,these methods show considerable improvement in cluster purity and personcoverage compared to state-of-the-art approaches to tracklet clustering. Werepresent entities with mixture components, and tracklets with vectors of verygeneric features, which can work for any type of entity (not necessarilyperson). The proposed methods can perform online tracklet clustering onstreaming videos with little performance deterioration unlike existingapproaches, and can automatically reject tracklets resulting from falsedetections. Finally we discuss entity-driven video summarization- where sometemporal segments of the video are selected automatically based on thediscovered entities.
机译:视频可以表示为一系列小轨迹,每个小轨迹跨越10-20帧,并与一个实体(例如,一个人)相关联。视频中\ emph {EntityDiscovery}的任务可以自然地构成为Tracklet聚类。我们通过利用\ emph {Temporal Coherence}(TC)来完成这项任务:每个轨迹的视频的基本属性是,每个轨迹可能与与它的时间邻居相同的实体相关联。我们的主要贡献是在小波级的TC的第一个贝叶斯非参数模型。我们将中文餐厅流程(CRP)扩展为提出TC-CRP,并进一步扩展到临时连贯的中文餐厅特许经营(TC-CRF)以共同对短时间段进行建模。在不使用脚本之类的元数据的情况下在电视连续视频中发现人物的任务上,与最新的小波聚类方法相比,这些方法在聚类纯度和人物覆盖率方面显示出显着提高。我们用混合成分表示实体,并用非常通用的特征向量表示小轨迹,它们可以用于任何类型的实体(不一定是人)。与现有方法相比,所提出的方法可以在视频流上执行在线小波聚类,而性能却几乎不受影响,并且可以自动拒绝由错误检测导致的小波。最后,我们讨论了实体驱动的视频摘要-根据发现的实体自动选择视频的某些时间段。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号